18 research outputs found

    Efficient Decompression of Binary Encoded Balanced Ternary Sequences

    Get PDF
    International audienceA balanced ternary digit, known as a trit, takes its values in {-1, 0, 1}. It can be encoded in binary as {11, 00, 01} for direct use in digital circuits. In this correspondence, we study the decompression of a sequence of bits into a sequence of binary encoded balanced ternary digits. We first show that it is useless in practice to compress sequences of more than 5 ternary values. We then provide two mappings, one to map 5 bits to 3 trits and one to map 8 bits to 5 trits. Both mappings were obtained by human analysis and lead to Boolean implementations that compare quite favorably with others obtained by tweaking assignment or encoding optimization tools. However, mappings that lead to better implementations may be feasible

    HLS-Based Methodology for Fast Iterative Development Applied to Elliptic Curve Arithmetic

    No full text
    International audienceHigh-Level Synthesis (HLS) is used by hardware developers to achieve higher abstraction in circuit descriptions. In order to shorten the hardware development time via HLS, we present an adjustment of the Iterative and Incremental Design (IID) methodology, frequently used in software development. In particular, our methodology is relevant for the development of applications with unusual complexity: the method was applied here to the development of large modular arithmetic, commonly used for cryptography applications (e.g., Elliptic Curves). Rapid feedback on circuit characteristics is used to evaluate deep architectural changes in short time, greatly reducing the time-to-market with respect to hand-made designs. In addition, our approach is highly flexible, since the same generic high-level description can be used to produce an entire set of circuits, each with different area/performance trade-offs. Thanks to the proposed approach, any change to the initial specification (e.g., the curve used) is also very fast, while it may require a large effort in the case of hand-made designs

    High-level synthesis for fast generation of hardware accelerators under resource constraints

    No full text
    Dans le domaine du calcul gĂ©nĂ©rique, les circuits FPGA sont trĂšs attrayants pour leur performance et leur faible consommation. Cependant, leur prĂ©sence reste marginale, notamment Ă  cause des limitations des logiciels de dĂ©veloppement actuels. En effet, ces limitations obligent les utilisateurs Ă  bien maĂźtriser de nombreux concepts techniques. Ils obligent Ă  diriger manuellement les processus de synthĂšse, de façon Ă  obtenir une solution Ă  la fois rapide et conforme aux contraintes des cibles matĂ©rielles visĂ©es.Une nouvelle mĂ©thodologie de gĂ©nĂ©ration basĂ©e sur la synthĂšse d'architecture est proposĂ©e afin de repousser ces limites. L'exploration des solutions consiste en l'application de transformations itĂ©ratives Ă  un circuit initial, ce qui accroĂźt progressivement sa rapiditĂ© et sa consommation en ressources. La rapiditĂ© de ce processus, ainsi que sa convergence sous contraintes de ressources, sont ainsi garanties. L'exploration est Ă©galement guidĂ©e vers les solutions les plus pertinentes grĂące Ă  la dĂ©tection, dans les applications Ă  synthĂ©tiser, des sections les plus critiques pour le contexte d'utilisation rĂ©el. Cette information peut ĂȘtre affinĂ©e Ă  travers un scĂ©nario d'exĂ©cution transmis par l'utilisateur.Un logiciel dĂ©monstrateur pour cette mĂ©thodologie, AUGH, est construit. Des expĂ©rimentations sont menĂ©es sur plusieurs applications reconnues dans le domaine de la synthĂšse d'architecture. De tailles trĂšs diffĂ©rentes, ces applications confirment la pertinence de la mĂ©thodologie proposĂ©e pour la gĂ©nĂ©ration rapide et autonome d'accĂ©lĂ©rateurs matĂ©riels complexes, sous des contraintes de ressources strictes. La mĂ©thodologie proposĂ©e est trĂšs proche du processus de compilation pour les microprocesseurs, ce qui permet son utilisation mĂȘme par des utilisateurs non spĂ©cialistes de la conception de circuits numĂ©riques. Ces travaux constituent donc une avancĂ©e significative pour une plus large adoption des FPGA comme accĂ©lĂ©rateurs matĂ©riels gĂ©nĂ©riques, afin de rendre les machines de calcul simultanĂ©ment plus rapides et plus Ă©conomes en Ă©nergie.In the field of high-performance computing, FPGA circuits are very attractive for their performance and low consumption. However, their presence is still marginal, mainly because of the limitations of current development tools. These limitations force the user to have expert knowledge about numerous technical concepts. They also have to manually control the synthesis processes in order to obtain solutions both fast and that fulfill the hardware constraints of the targeted platforms.A novel generation methodology based on high-level synthesis is proposed in order to push these limits back. The design space exploration consists in the iterative application of transformations to an initial circuit, which progressively increases its rapidity and its resource consumption. The rapidity of this process, along with its convergence under resource constraints, are thus guaranteed. The exploration is also guided towards the most pertinent solutions thanks to the detection of the most critical sections of the applications to synthesize, for the targeted execution context. This information can be refined with an execution scenarion specified by the user.A demonstration tool for this methodology, AUGH, has been built. Experiments have been conducted with several applications known in the field of high-level synthesis. Of very differen sizes, these applications confirm the pertinence of the proposed methodology for fast and automatic generation of complex hardware accelerators, under strict resource constraints. The proposed methodology is very close to the compilation process for microprocessors, which enable it to be used even by users non experts about digital circuit design. These works constitute a significant progress for a broader adoption of FPGA as general-purpose hardware accelerators, in order to make computing machines both faster and more energy-saving

    Génération rapide d'accélérateurs matériels par synthÚse d'architecture sous contraintes de ressources

    No full text
    ISBN : 978-2-11-129186-7Even if FPGA circuits are very attractive for their performance and low power consumption, their usage as hardware accelerators is still marginal. Indeed, the existing development tools are only accessible to users with expertise in circuit design. In order to reduce their limits, a novel generation methodology based on high-level synthesis is proposed. By iteratively applying transformations to an initial solution, the process rapidly converges and strictly respects hardware constraints, particularly the available ressources. A demonstration tool, AUGH, has been built, and experiments have been launched with several known applications. THe proposed methodology is very close to the compilation flow for microprocessors, which allows it to be used even by users with no expertise about digital circuit design.Bien que les FPGA soient trĂšs attrayants pour leur performance et leur faible consommation, leur emploi en tant qu'accĂ©lĂ©rateurs matĂ©riels reste marginal. Les logiciels de dĂ©veloppement existants ne sont en effet accessibles qu'Ă  un public expert en conception de circuits. Afin de repousser leurs limites, une nouvelle mĂ©thodologie de gĂ©nĂ©ration basĂ©e sur la synthĂšse d'architecture est proposĂ©e. En appliquant des transformations successives Ă  une solution initiale, le processus converge rapidement et permet de respecter strictement des contraintes matĂ©rielles, notamment en ressources. Un logiciel dĂ©monstrateur, AUGH, a Ă©tĂ© construit, et des expĂ©rimentations ont Ă©tĂ© menĂ©es sur plusieurs applications reconnues. La mĂ©thodologie proposĂ©e est trĂšs proche du processus de compilation pour les microprocesseurs, ce qui permet son utilisation mĂȘme par des utilisateurs non spĂ©cialistes de la conception de circuits numĂ©riques

    Génération rapide d'accélérateurs matériels par synthÚse d'architecture sous contraintes de ressources

    No full text
    ISBN : 978-2-11-129186-7Even if FPGA circuits are very attractive for their performance and low power consumption, their usage as hardware accelerators is still marginal. Indeed, the existing development tools are only accessible to users with expertise in circuit design. In order to reduce their limits, a novel generation methodology based on high-level synthesis is proposed. By iteratively applying transformations to an initial solution, the process rapidly converges and strictly respects hardware constraints, particularly the available ressources. A demonstration tool, AUGH, has been built, and experiments have been launched with several known applications. THe proposed methodology is very close to the compilation flow for microprocessors, which allows it to be used even by users with no expertise about digital circuit design.Bien que les FPGA soient trĂšs attrayants pour leur performance et leur faible consommation, leur emploi en tant qu'accĂ©lĂ©rateurs matĂ©riels reste marginal. Les logiciels de dĂ©veloppement existants ne sont en effet accessibles qu'Ă  un public expert en conception de circuits. Afin de repousser leurs limites, une nouvelle mĂ©thodologie de gĂ©nĂ©ration basĂ©e sur la synthĂšse d'architecture est proposĂ©e. En appliquant des transformations successives Ă  une solution initiale, le processus converge rapidement et permet de respecter strictement des contraintes matĂ©rielles, notamment en ressources. Un logiciel dĂ©monstrateur, AUGH, a Ă©tĂ© construit, et des expĂ©rimentations ont Ă©tĂ© menĂ©es sur plusieurs applications reconnues. La mĂ©thodologie proposĂ©e est trĂšs proche du processus de compilation pour les microprocesseurs, ce qui permet son utilisation mĂȘme par des utilisateurs non spĂ©cialistes de la conception de circuits numĂ©riques

    Génération rapide d'accélerateurs matériels par synthÚse d'architecture sous contraintes de ressources

    No full text
    In the field of high-performance computing, FPGA circuits are very attractive for their performance and low consumption. However, their presence is still marginal, mainly because of the limitations of current development tools. These limitations force the user to have expert knowledge about numerous technical concepts. They also have to manually control the synthesis processes in order to obtain solutions both fast and that fulfill the hardware constraints of the targeted platforms.A novel generation methodology based on high-level synthesis is proposed in order to push these limits back. The design space exploration consists in the iterative application of transformations to an initial circuit, which progressively increases its rapidity and its resource consumption. The rapidity of this process, along with its convergence under resource constraints, are thus guaranteed. The exploration is also guided towards the most pertinent solutions thanks to the detection of the most critical sections of the applications to synthesize, for the targeted execution context. This information can be refined with an execution scenarion specified by the user.A demonstration tool for this methodology, AUGH, has been built. Experiments have been conducted with several applications known in the field of high-level synthesis. Of very differen sizes, these applications confirm the pertinence of the proposed methodology for fast and automatic generation of complex hardware accelerators, under strict resource constraints. The proposed methodology is very close to the compilation process for microprocessors, which enable it to be used even by users non experts about digital circuit design. These works constitute a significant progress for a broader adoption of FPGA as general-purpose hardware accelerators, in order to make computing machines both faster and more energy-saving.Dans le domaine du calcul gĂ©nĂ©rique, les circuits FPGA sont trĂšs attrayants pour leur performance et leur faible consommation. Cependant, leur prĂ©sence reste marginale, notamment Ă  cause des limitations des logiciels de dĂ©veloppement actuels. En effet, ces limitations obligent les utilisateurs Ă  bien maĂźtriser de nombreux concepts techniques. Ils obligent Ă  diriger manuellement les processus de synthĂšse, de façon Ă  obtenir une solution Ă  la fois rapide et conforme aux contraintes des cibles matĂ©rielles visĂ©es.Une nouvelle mĂ©thodologie de gĂ©nĂ©ration basĂ©e sur la synthĂšse d'architecture est proposĂ©e afin de repousser ces limites. L'exploration des solutions consiste en l'application de transformations itĂ©ratives Ă  un circuit initial, ce qui accroĂźt progressivement sa rapiditĂ© et sa consommation en ressources. La rapiditĂ© de ce processus, ainsi que sa convergence sous contraintes de ressources, sont ainsi garanties. L'exploration est Ă©galement guidĂ©e vers les solutions les plus pertinentes grĂące Ă  la dĂ©tection, dans les applications Ă  synthĂ©tiser, des sections les plus critiques pour le contexte d'utilisation rĂ©el. Cette information peut ĂȘtre affinĂ©e Ă  travers un scĂ©nario d'exĂ©cution transmis par l'utilisateur.Un logiciel dĂ©monstrateur pour cette mĂ©thodologie, AUGH, est construit. Des expĂ©rimentations sont menĂ©es sur plusieurs applications reconnues dans le domaine de la synthĂšse d'architecture. De tailles trĂšs diffĂ©rentes, ces applications confirment la pertinence de la mĂ©thodologie proposĂ©e pour la gĂ©nĂ©ration rapide et autonome d'accĂ©lĂ©rateurs matĂ©riels complexes, sous des contraintes de ressources strictes. La mĂ©thodologie proposĂ©e est trĂšs proche du processus de compilation pour les microprocesseurs, ce qui permet son utilisation mĂȘme par des utilisateurs non spĂ©cialistes de la conception de circuits numĂ©riques. Ces travaux constituent donc une avancĂ©e significative pour une plus large adoption des FPGA comme accĂ©lĂ©rateurs matĂ©riels gĂ©nĂ©riques, afin de rendre les machines de calcul simultanĂ©ment plus rapides et plus Ă©conomes en Ă©nergie

    High-Efficiency Convolutional Ternary Neural Networks with Custom Adder Trees and Weight Compression

    No full text
    International audienceAlthough performing inference with artiicial neural networks (ANN) was until quite recently considered as essentially compute intensive, the emergence of deep neural networks coupled with the evolution of the integration technology transformed inference into a memory bound problem. This ascertainment being established, many works have lately focused on minimizing memory accesses, either by enforcing and exploiting sparsity on weights or by using few bits for representing activations and weights, so as to be able to use ANNs inference in embedded devices. In this work, we detail an architecture dedicated to inference using ternary {−1, 0, 1} weights and activations. This architecture is conngurable at design time to provide throughput vs power trade-oos to choose from. It is also generic in the sense that it uses information drawn for the target technologies (memory geometries and cost, number of available cuts, etc) to adapt at best to the FPGA resources. This allows to achieve up to 5.2k fps per Watt for classiication on a VC709 board using approximately half of the resources of the FPGA. Additional Key Words and Phrases: Ternary CNN, low power inference, hardware acceleration, FPGA ACM Reference format: Adrien Prost-Boucle, Alban Bourge, and FrĂ©dĂ©ric PĂ©trot. 2018. High-EEciency Convolutional Ternary Neural Networks with Custom Adder Trees and Weight Compression

    Fast and Standalone Design Space Exploration for High-Level Synthesis under Resource Constraints

    No full text
    International audienceThe very high computing capacity available in the latest Field Programmable Gate Array (FPGA) compo- nents allows to extend their application fields, in High-Performance Computing (HPC) as well as in embedded applications. This paper presents a new methodology for Design Space Exploration (DSE) in the context of High Level Synthesis (HLS) for HPC and embedded systems targeting FPGAs.This new methodology provides very quickly an RTL description of the design under resources constraints. An autonomous flow is described, that performs incremental transformations of the input design description. The low complexity of the transformation evaluation, decision and exploration algorithms, associated with a greedy progression, makes this DSE methodology very fast. Moreover, this methodology respects a strict resource constraint given as bare FPGA primitive amounts. Hence, the generated design fits into the targeted FPGA or a partition of it. Such a methodology leads to autonomous, fast and transparent DSE, all these issues known to limit the use of HLS.Results on several benchmarks highlight the capabilities of our DSE methodology. The results show a high generation time speed-up compared to one other existing HLS approach, while preserving correct performance of the generated circuits
    corecore